Knowledge Sharing from Domain-specific Documents

نویسندگان

  • Eiko Yamamoto
  • Hitoshi Isahara
  • Akira Terada
  • Yasunori Abe
چکیده

Recently, collaborative discussions based on the participant generated documents, e.g., customer questionnaires, aviation reports and medical records, are required in various fields such as marketing, transport facilities and medical treatment, in order to share useful knowledge which is crucial to maintain various kind of securities, e.g., avoiding air-traffic accidents and malpractice. We introduce several techniques in natural language processing for extracting information from such text data and verify the validity of such techniques by using aviation documents as an example. We automatically and statistically extract from the documents related words that have not only taxonomical relations like synonyms but also thematic (non-taxonomical) relations including causal and entailment relations. These related words are useful for sharing information among participants. Moreover, we acquire domain-specific terms and phrases from the documents in order to pick up and share important topics from such reports.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Content-based Retrieval of Analytical Reports

Analytic reports are special textual documents containing condensed results from a data mining process. Embedded knowledge enables the interpretation of the reports by automated procedures, which opens the way to content-based retrieval. We elaborate the technique for statistical association rules as specific form of discovered knowledge, demonstrate its formal apparatus on examples from the me...

متن کامل

Conceptual modelling for domain specific document description and retrieval - An approach to semantic document modelling

Organisations and individuals today are exposed to vast numbers of documents in their daily work. Modern retrieval techniques, knowledge management systems, the Semantic Web initiative and several related efforts all strive to improve information sharing and to arrive at languages, methods and tools for semantic document retrieval. Approaches from conceptual modeling have not been widely applie...

متن کامل

Extraction of Informative Expressions from Domain-specific Documents

What kinds of lexical resources are helpful for extracting useful information from domain-specific documents? Although domain-specific documents contain much useful knowledge, it is not obvious how to extract such knowledge efficiently from the documents. We need to develop techniques for extracting hidden information from such domain-specific documents. These techniques do not necessarily use ...

متن کامل

XML-Hoo! A Prototype Application for Intelligent Query of XML Documents Using Domain-Specific Ontologies

Use of XML holds great promise for standardizing data models for realizing benefits such as lowered development costs and time for integrating inter-organizational business processes and intra-organizational knowledge management. Further benefits can be realized by formally defining common semantics in ontologies using the standardized models. Automation of business processes that require shari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017